19. Notebook + Quiz: Multicollinearity & VIFs
Workspace
This section contains either a workspace (it can be a Jupyter Notebook workspace or an online code editor work space, etc.) and it cannot be automatically downloaded to be generated here. Please access the classroom with your account and manually download the workspace to your local machine. Note that for some courses, Udacity upload the workspace files onto https://github.com/udacity , so you may be able to download them there.
Workspace Information:
- Default file path:
- Workspace type: jupyter
- Opened files (when workspace is loaded): n/a
SOLUTION:
- It appears that the predictor variables are correlated with one another.
- The variables that appear to be most correlated are the number of bedrooms and bathrooms.
SOLUTION:
- As the number of bathrooms increases, we predict the price to increase.
- As the area of the home increases, we predict the price to increase.
SOLUTION:
We should remove either bedrooms or bathrooms, because they both have VIFs greater than 10.SOLUTION:
- All VIFs are now below 10.
- All of the coefficients are now positive, as we would expect.
- To three digits the Rsquared value stayed the same, suggesting we didn't really need both bedrooms and bathrooms in the model.